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COMPOUNDS 



Technical Field 



5 This invention relates to cyclopropyl indoles (CPI) (which term 
is used to encompass cyclopropylbenzindoles (CBI)) compounds 
and their precursors, to methods of synthesizing these 
compounds on solid supports, and to compounds of utility 
therein. This invention further relates to collections of 
10 these compounds, and methods for identifying and isolating CPI 
and precursor compounds with useful and diverse activities 
from such collections. 



Background to the invention 

15 

A large number of both synthetic and naturally occurring low 
molecular weight ligands are known that interact with DNA via 
a number of different mechanisms, including covalent or non- 
covalent interaction in the minor or major grooves, 
20 intercalation between base pairs or other types of non- 
specific interactions . 

Of the class of ligands which interact with the minor groove, 
GC specific ligands include Chromomycin, pyrrolo[2,l- 
25 c] [1, 4] benzodiazepines (PBDs) , Mitomycins and Ecteinascidins. 
Of these, all but Chromomycin form a covalent bond with the 
DNA. Of AT specific ligands, cyclopropylindoles form covalent 
bonds, whilst compounds such as distamycin and netropsin do 
not . 

30 

Cyclopropylindole (CPI) compounds are a class of highly potent 
antitumour antibiotics which includes the natural products 
CC-1065 (V.L. Reynolds et al, J. Antibiot., 39, 1986, 319-314) 
and the duocarmycins (D.L. Boger, Pure & Appl . Chem., 66, 



1994, 837-844), having IC 50 s in the low pM range. They are of 
the general structures A and B: 




OH O 
A B 



5 

Studies with compounds that model the binding subunit have 
shown that the more stable open chain seco-precursors (e.g. A) 
are as potent as the cyclopropylindole compounds (e.g. B) . 
Further, ring closure is not essential for DNA binding. 

10 

A number of synthetic analogues of the natural products have 
been prepared in which the oxygen of A is protected as a 
carbamate that must be cleaved (by non-specific enzymatic 
hydrolysis) for activity. Further analogues of a similar 
15 type are disclosed in WO88/04659 and W091/16324. Analogues 

where the 6 substituent is N or S are disclosed in WO 97/07097 
and WO 98/11101 . 

Compounds having biological activity can be identified by 
20 screening collections of compounds (i.e. libraries of 

compounds) produced through synthetic chemical techniques. 
Such screening methods include methods wherein the library 
comprises a plurality of compounds synthesized at specific 
locations on the surface of a solid support where a receptor 
25 is appropriately labelled to identify binding to the compound, 
e.g., fluorescent or radioactive labels. Correlation of the 
labelled receptor bound to the support with its location on 
the support identifies the binding compound (US 5,143,854). 



Central to these methods is the screening of a multiplicity of 
compounds in the library and the ability to identify the 
structures of the compounds which have a requisite biological 

act i^it-y. Tn order to facilitate synthesis and 

5 identification, the compounds in the library are typically 
formed on solid supports. Usually each such compound is 
covalently attached to the support via a cleavable or 
non-cleavable linking arm. The libraries of compounds can be 
screened either on the solid support or as cleaved products to 

10 identify compounds having good biological activity. 



Disclosure of the Invention 

The present invention provides CPI compounds with structures that 
15 allow them to be joined to combinatorial chains, as well as 
combinatorial libraries containing CPIs themselves. 

A first aspect of the present invention relates to compounds of 
formula I : 




Y 



wherein X is an electrophilic leaving group; 

Y is selected from NH-Prot, O-Prot, S-Prot, N0 2 , NHOH, N 3 , NHR, 
25 NRR, N=NR, N(0)RR, NHS0 2 R, N=NPhR, SR or SSR, where Prot 
represents a protecting group; 

A and B collectively represent a fused benzene or pyrrole ring 
(in either orientation), which is optionally substituted by up 
to respectively 4 or 2 groups independently selected from R, 
30 OH, OR, halo, nitro, amino, Me 3 Sn, C0 2 H, C0 2 R; 



4 



R, is a nitrogen protecting group, where if Y includes a 

protecting group, these protecting groups are orthogonal; 

R 2 and R 7 are independently selected from H, R, OH, OR, halo, 

nitro, — amino, — M e 3 5n; 

5 wherein R is selected from 

(a) a lower alkyl group having 1 to 10 carbon atoms, 

(b) an aralkyl group (i.e. an alkyl group with one or more 
aryl substituents ) , preferably of up to 12 carbon atoms; 
the alkyl group of (a) or (b) optionally containing one or 

10 more carbon-carbon double or triple bonds, which may form part 
of a conjugated system; and 

(c) an aryl group, preferably of up to 12 carbon atoms; and 
wherein R is optionally substituted by one or more halo, 
hydroxy, amino, or nitro groups, and optionally contains one 

15 or more hetero atoms, which may form part of, or be, a 
functional group; 

except that when R a is Boc, Y is N0 2 , X is CI, and R 2 and R 7 are 
H, then A and B do not collectively represent either an 
unsubstituted benzene ring or: 

20 

Me0 2 C 



a 



30 




If R is an aryl group, and contains a hetero atom, then R is 
heterocyclic group. If R is an alkyl chain, and contains a 
25 hetero atom, the hetero atom may be located anywhere in the 
alkyl chain, e.g. -0-C 2 H 5 , -CH 2 -S-CH 3/ or may form part of, or 
be, a functional group, e.g. carbonyl, hydroxy. 



R is preferably independently selected from a lower alkyl 
group having 1 to 10 carbon atoms, or an aralkyl group, 
preferably of up to 12 carbon atoms, or an aryl group, 
preferably of up to 12 carbon atoms, optionally substituted by 
one or more halo, hydroxy, amino, or nitro groups. It is more 



preferred that R is independently selected from lower alkyl 
groups having 1 to 10 carbon atoms optionally substituted by 
one or more halo, hydroxy, amino, or nitro groups. It is 
particularl y r rpfprrpH tliat B Ls an unsubsti tuted straight or 
5 branched chain alkyl group, having 1 to 10, preferably 1 to 6, 
and more preferably 1 to 4, carbon atoms, e.g. methyl, ethyl, 
propyl, butyl. 

These compounds are useful in the synthesis of collections of 
10 CBI and CPI precursors. Compounds of formula I (including 

those in the proviso) can be attached to a solid support, e.g. 

via a connecting link which may comprise a chain of 

combinatorial units. This is a further aspect of the 

invention, i.e. the use of compounds of formula I in methods 
15 of combinatorial chemistry synthesis, wherein the compound of 

formula I is joined to a solid support by a chain comprising 

at least one combinatorial unit. 



20 



The term 'protecting group' (and more specifically 'nitrogen 
protecting group' ) has the meaning usual in synthetic 
chemistry, particularly for 'nitrogen protecting group' , in 
synthetic peptide chemistry. It means any group which may be 
covalently bound to the protected atom of the CBI or CPI 
grouping, and permits reactions to be carried out upon the 
25 molecule containing this grouping without its removal. 

Nevertheless, it is able to be removed from the protected atom 
without affecting the remainder of the molecule. Suitable 
protecting groups for the present invention include Fmoc 
(9-f luorenylmethoxycarbonyl) , Nvoc 
30 (6-nitroveratryloxycarbonyl) , Teoc 

(2-trimethylsilylethyloxycarbonyl) , Troc 

(2,2, 2-trichloroethyloxycarbonyl) , Boc ( t-butyloxycarbonyl ) , 
CBZ (benzyloxycarbonyl) , Alloc ( allyloxycarbonyl ) , Psec 
(2(-phenylsul P honyl)ethyloxycarbonyl), (t-butyl ether), Benzyl 
35 ether, Silyl ether, MOM (methoxy methyl ether) , MEM (2-methoxy 




6 



ethoxy methyl ether) or acetate. Other suitable groups are 
described in Protective Groups in Organic Synthesis, T Green 
and P Wuts, published by Wiley, 1991, which is incorporated 
herein Dy reference. 

5 

It is preferred that the nitrogen protecting group has a 
carbamate functionality where it binds to the nitrogen atom to 
be protected. 

10 The term ^orthogonal ' in relation to 'protecting groups' has 
the meaning usual in synthetic chemistry. It means that one 
protecting group may be selectively removed without affecting 
the other protecting group. This is achieved by using 
protecting groups which are sensitive to different removal 

15 conditions. 

Y is preferably NH-Prot, O-Prot, S-Prot, and more preferably 
NH-Prot . 

20 It is preferred that X is either halogen or OS0 2 R, where R is 
as defined earlier. A halogen group means a fluoro, chloro, 
bromo or iodo group. It is more preferred that X is CI. 

It is further preferred that the 4,5 fused ring is substituted 
25 by -C0 2 R in the 2 or 3 position if it is a the benzene ring, 
and the 2 position if it is a pyrrole ring, e.g. 




where FU and R 7 are preferably H, with the proviso that Y is 
not N0 2 when R a is Boc, R 2 and R 7 are H, and X is Cl. 

A c^rond n.sppr t of the present invention relates to compounds 

5 of the formula III: 



15 




R 7 



(III) 



wherein X, Y, A, B, R 2 and R 7 are as defined in the first 
aspect; 

10 T is a combinatorial unit; 

and n is a positive integer, where if n is greater than 1, 
each T may be different; 

L is a linking group, or less preferably a single bond; 
and O is a solid support. 



A third aspect of the present invention relates to compounds 
of formula III' : 



(III') 



20 where A, B, R 2/ R 7 , T, n, L and O are as defined in the second 
aspect; and 
Y' is NH, O or S. 



Compounds of this aspect are obtainable from compounds of the 
25 second aspect by removal of the protecting group preferably 
carbamate in Y (if present), or by other appropriate 
reactions, e.g. reduction and/or basic conditions. The 



cyclisation may also occur spontaneously. A fourth aspect of 
the present invention relates to compounds of formula II: 





XX 



10 



15 



20 



25 



(II) 



wherein X, Y, A, B, R 2 , R 7/ T and n are as defined in the 
second aspect of the invention. 

Compounds of this aspect are obtainable by cleavage of the 
linking group of the appropriate compound of the second 
aspect . 

A fifth aspect of the present invention relates to compounds 
of the formula II' : 




on 



wherein A, B, T, n, R 2 and R 7 are as defined in the fourth 
aspect of the invention; and 
Y' is NH, NR, 0, or S . 

Compounds of this aspect are obtainable by cleavage of the 
linking group of the appropriate compound of the third aspect 
(see below) . Alternatively, they are obtainable from 
compounds of the fourth aspect by removal of the protecting 
group in Y (if present), or by other appropriate reactions, 
e.g. reduction and/or basic conditions. The cyclisation may 
also occur spontaneously. 



A sixth aspect of the present invention relates to compounds 
of formula V: 




(V) 



where A, B, Y, R x , R 2 , R 7 , and X are defined in the first 
aspect of the invention (including the excluded compounds) T, 
n, L and O are as defined in the second aspect of the present 
invention. 



10 



15 



20 



A seventh aspect of the present invention relates to compounds 
of formula V : 



II A 



B 




(V) 



where A, B, R a , R 2 , R„ T, n, L and O are as defined in the 

sixth aspect; and 

Y' is NH, NR, O or S. 

Compounds of this aspect are obtainable from compounds of the 
sixth aspect by removal of the protecting group in Y (if 
present), or by other appropriate reactions, e.g. reduction 
and/or basic conditions. The cyclisation may also occur 
spontaneously . 



25 



An eighth aspect of the present invention relates to compounds 
of formula IV: 



10 



H-(-T-hr N 




(IV) 



wherein A, B, X, Y, T, n, Rj, R 2 and R 7 are as defined in the 
sixth aspect of the invention. 



Compounds of this aspect are obtainable by cleavage of the 
linking group of the appropriate compound of the sixth aspect. 

A ninth aspect of the present invention relates to compounds 
10 of formula IV : 



wherein A, B, T, n, R a/ R 2 and R 7 are as defined in the eighth 
aspect of the invention; and 
15 Y' is NH, NR, O, or S. 

Compounds of this aspect are obtainable by cleavage of the 
linking group of the appropriate compound of the seventh 
aspect (see below) . Alternatively, they are obtainable from 
2 0 compounds of the eighth aspect by removal of the protecting 
group in Y (if present), or by other appropriate reactions, 
e.g. by reduction and/or basic conditions. The cyclisation 
may also occur spontaneously. 



5 




25 



A tenth aspect of the invention is the preparation of 
compounds according to either the second or sixth aspects by 
reaction of a compound of formula VI: 




11 




L-f-T^w (VI) 



with a compound of formula I according to the first aspect, 
where T, n, L and O are as defined in these aspects, and W is 
5 H or an atom or group for providing a functional group capable 
of reaction with -COOH or -NH 2 . This reaction will include the 
necessary protection and deprotection steps so as to 
selectively achieve a compound according to the second or 
sixth aspect. 

0 

An eleventh aspect of the invention relates to compounds of 
formula VII: 



15 wherein: 

O, T, and L are as defined in the second aspect of the 
invention; 

n and m are positive integers, or one of them may be zero; 
T 1 is a combinatorial unit, where each T 1 may be different if 
20 m is greater than 1; 

T fl is a combinatorial unit which provides a site for the 

attachment of D; 

D is selected from: 



D 



(VII) 



13 



wherein X' is selected from CO, NH, S, or 0/ 

G is 0, S, NH, or a single bond; 

FC 2 and R' 3 are — radep-e-nde n tly se l e cted from: H, — — QH-, — QR-, — =0, 

5 =CH-R, =CH 2 , CH 2 -C0 2 R, CH 2 -C0 2 H, CH 2 -S0 2 R, 0-S0 2 R, C0 2 R, COR and 

CN, and there is optionally a double bond between C 2 and C 3 ; 

R' 6 , R f 7 / and R ' 9 are independently selected from H, R, OH, OR, 

halo, nitro, amino, Me 3 Sn; 

R' n is either H or R; 
10 Q' is S, 0 or NH; 

R' 10 is a nitrogen protecting group; 

Y" is a divalent group such that HY = R; 

p is a positive integer, where if p is greater than 1, for 
each repeating unit, the meaning of T, T' , T" and D and the 
15 values of n and m are independently selected; and 

E is selected from the same possibilities as D, provided that 
at least one group D or E is selected from (a) . 

If, for example, D terminates with CO then the site on T" may 
20 be NH, and if D terminates with NH, S or 0, then the site on 
T" may be CO. 

A twelfth aspect of the invention relates to. compounds of 
formula (VIII) : 



H—fT-fe-T-t-Hrr 6 (VI") 



wherein L, T, T' , T", D, E, n, m and p are as defined in the 
eleventh aspect of the invention. 



A thirteenth aspect of the invention relates to compounds of 
formula (IX) : 



14 



H 



4- 

T'» 



T' 



Q— 4^ r -^-<- T -klr E - 



(IX) 



wherein O, L, T, T' , T", n, m and p are as defined in the 
5 eleventh aspect of the invention; 
T' " is a combinatorial unit/ 

q is a positive integer, where if q is greater than 1, each 
T' " may be different; and 

E a is selected from the group (a) of E as defined in the 
10 eleventh aspect of the invention; 

where if p is greater than 1, for each repeating unit the 
meaning of T, T' , T", T"' and the values of n, m and q are 
independently selected. 

15 A fourteenth aspect of the invention relates to compounds of 
formula (X) : 



20 wherein L, T, T' , T", T"' , E a/ n, m, p and q are as defined in 
the thirteenth aspect of the invention. 

It is recognised that the compounds of formulae I, II, II', 
III, III', IV, IV, V, V, VII, VIII, IX and X may exist in 
25 different enantiomeric or diast ereomeric forms. In such cases 
it is to be understood that the above formulae represent any 
possible enantiomeric or diastereomeric form or a mixture 
thereof . 



H 



T 




(X) 




Solid support 

The term 'solid support' refers to a material having a rigid 

— o-r—s- Bmi-xiyr d—s-ti-rf-atre-- w h ic h contains or can b e derivatiz e d to — 

5 contain reactive functionalities which can serve for 

covalently linking a compound to the surface thereof. Such 
materials are well known in the art and include, by way of 
example, silicon dioxide supports containing reactive Si-OH 
groups, polyacrylamide supports, polystyrene supports, 
10 polyethyleneglycol supports, and the like. Such supports will 
preferably take the form of small beads, pins/crowns, laminar 
surfaces, pellets, disks. Other conventional forms may also 
be used. 

15 Linker group 

The linking groups suitable for the present application are 
ones which usually provide in the structure: 



20 




at least one covalent bond which can be readily broken by 
specific chemical reactions (or by light or changes in pH) 
thereby providing for liberation of compounds free from the 
solid support. The methods employed to break the covalent 

25 bond are selected so as to be specific for the desired bond 
breakage thereby preventing unintended reactions from 
occurring elsewhere on the complex. The linking group is 
selected relative to the synthesis of the compounds to be 
formed on the solid support so as to prevent premature 

30 cleavage of this compound from the solid support as well as 
limit interference by any of the procedures employed during 
compound synthesis on the support. 




16 



Examples of resins incorporating cleavable linking groups are 
set out in the table below, which also indicates the groups 
that can be immobilised thereon, along with the suggested 

cleavage method s for the linki rrg— gr-oup-^ Such re sins are 

5 commercially available (e.g. from NovaBiochem) . 



15 



T,i nker/Resin Tvoe 


Immobilises 


Cleavage 
Method 


2-Chlorotrityl chloride 


RNH 2 , RC0 2 H, 
ROH, RSH 


1-50% TFA 


Trityl chloride 


RNH 2 , RC0 2 H, 
ROH, RSH 


1-5% TFA 


2-Methoxytrityl chloride 


RNH 2/ RC0 2 H, 
ROH, RSH 


1-5% 


Rink amide resin 


RC0 2 H 


95% TFA 


Sieber amide resin 


RC0 2 H 


1% TFA 


4-Sulfamyl- benzoyl 


RC0 2 H 


Alkylation 
/amines 


Wang resin 


ROH, ArOH, 
RNH 2 , RCO ? H 


15-95% TFA or DDQ 
or CAN 


HMPB-BHA 


ROH, ArOH, 
RC0 2 H 


1% TFA 


Bromoethyl photolinker 


RNH 2/ RC0 2 H, 
ROH, RSH 


hv 


Hydroxy ethyl photolinker 


RC0 2 H 


hv 


Aminoethyl photolinker 


RC0 2 H 


hv 



17 



Structures 





U = CT! ^-cniorotntyl chloride type 

D = H: trityl chloride type 

D = OMe: 2-methoxytrityl chloride 



[RESIN 



-Ci 




OMe NHFMOC 




MeCr _ ^ . 0 _^RES,N] 

NHFMOC 



Rink amide type 





RESIN] 4-sulfamyl-benzoyl type 




HMPB-BHA type 



D = NH 2 :amino-ethyl type 
D = OH: hydroxy-ethyl type 
D = Br: bromo ethyl type 
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For CPI precursors the most preferred linking group is one 
which may be deemed photolyt ically . Further the Rink amide 

linker is parLiculaxly suilable. 

5 

It is also possible that the linking group is a simple 
functionality provided on the solid support, e.g. amine, and 
in this case the linking group may be not be readily 
cleavable. This type of linking group is useful in the 

10 synthesis of large split and mix libraries which will be 

subjected to on-bead screening (see below), where cleavage is 
unnecessary. Such resins are commercially available from a 
large number of companies including NovaBiochem, ChemTech and 
Rapp Polymere. These resins include amino-Tentagel and 

15 aminomethylated polystyrene resin. 

Combinatorial Unit 

The term "combinatorial unit' means any monomer unit which can 
20 be used to build a chain attached to the solid support, 

usually by a linking group. Examples of molecules suitable 
for such chain building are found in Schreiber et al . (JACS, 
120. 1998, pp. 23-29), which is incorporated herein by 
reference. An important example of a unit is an amino acid 
25 residue. Chains may be synthesised by means of 

amine-protected amino acids. Fmoc protected amino-acids are 
available from a number of sources, such as Sigma and Nova 
Biochem. Both natural and unnatural amino acids can be used, 
e.g. D- and L-amino acids and heterocyclic amino acids. In 
30 particular, heterocyclic amino acids of the type found in the 
construction of netropsin and distamycin are of interest 
because of their DNA-recognition properties. 

Amine units can be used to make up peptoids: see Soth, M.J. 
35 and Nowick, J.S. 1997, Unnatural oligomer libraries, Curr. 




Opin, Chem. Biol. 1, no. 1: 120-129; Zuckermann et al., 1994, 
Discovery of Nanomolecular Ligands for 7-Transmembrane 
G-Protein-Coupled Receptors from a Diverse 

N _ (5ub 5 titu4^e^-g4 yc - in e Pept o id L ib rary, — Journal of Medicinal — 

5 Chemistry 37: 2678-85; Figliozzi, GMR et al., 1996, Synthesis 
of N-substituted Glycine Peptoid Libraries, Methods in 
Enzymology, 267 : 437-47; Simon, R J et al . , 1992, Peptoids: A 
Modular Approach to Drug Discovery, Proc. Natl. Acad. Sci . 
USA, 89:9367-71; which are all incorporated herein by 

10 reference . 

Other combinatorial units include PNAs (peptidonucleic acids) : 
PE Nielsen, et al, Science, 1991, 254, 1497; M Egholm, et al, 
Nature, 1993, 365, 566; M Egholm et al , JACS, 1992, 114, 1895; 

15 S C Brown, et al , Science, 1994, 265, 777; 5. K Saha, et al r 

JOC r 1993, 58, 7827; oligoureas: Burgess K, et al , 1995, Solid 
Phase Synthesis of Unnatural Biopolymers Containing Repeating 
Urea Units. Agnew. Chem. Int. Ed. Engl 34, no. 8:907; Burgess 
K, et al, 1997, Solid Phase Synthesis of Oligoureas; Journal 

20 of the American Chemical Society 119: 1556-64; and 

oligocarbamates: Moran E J et al, 1995, Novel Biopolymers for 
Drug Discovery. Biopolymers (Peptide Science); John Wiley and 
Sons 37: 213-19; Cho C Y et al , 1993, An Unnatural Biopolymer. 
Science 261: 1303-5; Paikoff S F et al, 1996, The Solid Phase 

25 Synthesis of N-Alkylcarbamate Oligomers. Tetrahedron Letters 
37, no. 32: 5653-56. All of these documents are incorporated 
herein by reference. 

A further aspect of the present invention relates to 
30 combinatorial units having formula I, where the fused ring 
collectively represented by A and B is substituted by C0 2 R. 



20 

Further combinatorial units of relevance to this invention are 
those of formulae (XIa/b) : 




5 

wherein R' 3 / R' s/ R'?# R' 9/ G and Y' are as defined in the 
eleventh aspect of the invention, and G' and Y" are independently 
selected from the possible groups for G and Y' respectively. In 
order to synthesise combinatorial chains containing such 

10 combinatorial units, the units may need to be joined to the chain 
in their protected form (see definition of groups D(b) above) . 
It is possible that the combinatorial units may remain in their 
protected form until the compound is cleaved from the solid 
support, or until the other components in the compound are 

15 deprotected. 

The present invention relates to libraries, or collections, of 
compounds, all of which are represented by a single one of the 
formulae II, II', III/ III' f IV, IV, V, V , VII, VIII, IX or 

20 X. The diversity of the compounds in a library may reflect 
the presence of compounds differing in the identities of one 
or more of (as appropriate) X, Y, Y' , A, B, R 2 , R 7/ X', Y" , G, 
R' 2 , R' 3 / R'e/ RS/ R' 9 , R'10/ Q' • R'n and/or in the identities 
of the combinatorial units T, T' , T" and T' " (when present). . 

25 The number of members in the library depends on the number of 
variants, and the number of possibilities for each variant. 
For example, if it is the combinatorial units which are 
varied, and there are 3 combinatorial units, with 3 
possibilities for each unit, the library will have 27 

30 compounds. 4 combinatorial units and 5 possibilities for each 




unit gives a library of 625 compounds. If, for instance, 
there is a chain of 5 combinatorial units with 17 
possibilities for each unit, the total number of members in 

the libra ry wnniH 1.4 million. A library may therefore 

5 comprise more than 1 000, 5 000, 10 000, 100 000 or a million 
compounds, which may be arranged as described below. 
In the case of free compounds (formulae II, II', IV, IV, VIII, 
X) the individual compounds are preferably in discrete volumes 
of solvents, e.g. in tubes or wells. In the case of bound 

10 compounds (formulae III, III', V, V , VII, IX) the individual 
compounds are preferably bound at discrete locations, e.g. on 
respective pins/crowns or beads. The library of compounds may 
be provided on a plate which is of a suitable size for the 
library, or may be on a number of plates of a standard size, 

15 e.g. 96 well plates. If the number of members of the library is 
large, it is preferable that each well on a plate contains a 
number of related compounds from the library, e.g. from 10 to 
100. One possibility for this type of grouping of compounds is 
where only a subset of the combinatorial units, or substituents, 

20 are known and the remainder are randomised; this arrangement is 
useful in iterative screening processes (see below) . The 
library may be presented in other forms that are well-known. 

A further aspect of the present invention is a method of 
25 preparing a diverse collection, or library of compounds, as 
discussed above. If the diversity of the library is in the 
combinatorial units, then the library may be synthesised by the 
stepwise addition of protected combinatorial units to a CPI/CBI 
precursor core, each step being interposed by a deprotection 
30 step. Such a method is exemplified later. Libraries- of this 
type can be prepared by the method known as "split and mix" 
which is described in Furka, A; Sebestyen, F; Asgedom, M and 
Dibo, G; General Method of Rapid Synthesis of Multicomponent 
Peptide Mixtures; International Journal of Peptide and Protein 
35 Research; 1991, 37, 487-193, which is incorporated herein by 
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reference. If the diversity of the library is in the 
substituent groups, the library may be synthesised by carrying 
out the same synthetic methods on a variety of starting 

ma Le i i als or key intermediates, — which alr e ady possess the 

5 necessary substituent patterns. 

The present invention also relates to a method of screening the 
compounds of formula II, II' , III, III'/ IV, IV, V, V, VII, 
VIII, IX or X to discover biologically active compounds. The 
screening can be to assess the binding interaction with nucleic 
acids, e.g. DNA or RNA, or proteins, or to assess the affect of 
the compounds against protein-protein or nucleic acid-protein 
interactions, e.g. transcription factor DP-1 with E2F-1, or 
estrogen response element (ERE) with human estrogen receptor (a 
66 kd protein which functions as hormone-activated transcription 
factor, the sequence of which is published in the art and is 
generally available) . The screening can be carried out by 
bringing the target macromolecules into contact with individual 
compounds or the arrays or libraries of individual compounds 
described above, and selecting those compounds, or wells with 
mixtures of compounds, which show the strongest effect. 

This effect may simply be the cytotoxicity of the compounds in 
question against cells or the binding of the compounds to 
25 nucleic acids. In the case of protein-protein or nucleic 

acid-protein interactions, the effect may be the disruption of 
the interaction studied. 

The binding of the compounds to nucleic acids may be assessed by 
30 labelling oligomers which contain a target sequence, and 

measuring the amount of labelled oligomers that bind to the 
compounds tested. The labelling may either be radiolabelling, 
or alternatively be labels detectable under visible or 
ultra-violet light. If this latter form of screening is carried 
35 out on compounds bound to solid supports which are in separate 




locations, the screening for results can be carried out visually 
under a microscope. A similar technique is described in detail 
in DNA-Binding ligands from peptide libraries containing 

unnatur -a-1— e-m^o a c ids, L e ^crinier et *1 — , — Chem Ear J, — 1 998 , 

5 425-433. These techniques are particularly suited to a one-step 
screening of a complete library of compounds, especially a large 
library made by the "split and mix" method described above. 

Protein-protein interactions can be measured in a number of 
10 ways, e.g. FRET (fluorescence resonance energy transfer) which 
involves labelling one of the proteins with a fluorescent donor 
moiety and the other with an acceptor which is capable of 
absorbing the emission from the donor; the fluorescence signal 
of the donor will be altered depending on the interaction 
15 between the two proteins. Another method of measuring 

protein-protein interactions is by enzymatic labelling, using, 
for example, horseradish peroxidase. 

The screening process may undergo several iterations by 
20 selecting the most active compounds, or group of compounds, 

tested in each iteration; this is particular useful when testing 
arrays of wells which include mixtures of related compounds. 
Furthermore, if the wells contain compounds for which only a 
subset of the combinatorial units, or substituents, are known, 
25 but the rest are randomised, subsequent iterations can be 

carried out by synthesising compounds possessing the selected 
known (and successful) combinatorial unit, or substituent, 
pattern, but with further specified combinatorial units, or 
substituents, replacing the previously randomised combinatorial 
30 units, or substituents, adjacent the already known pattern; the 
remaining combinatorial units, or substituents, are randomised 
as in the previous iteration. This iterative method enables the 
identification of active members of large libraries without the 
need to isolate every member of the library. 



35 
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A further feature of this aspect is formulation of selected 
compound or compounds with pharmaceutically acceptable carriers 
or diluents. 

5 In yet further aspects, the invention provides a pharmaceutical 
composition comprising a compound of formula II, II' , IV, IV, 
VIII or X and a pharmaceutically acceptable carrier or diluent; 
and the use of a compound of formula II, II', IV, IV, VIII or X 
in the manufacture of a medicament for the treatment of a 
10 gene-based disease, or a bacterial, parasitic or viral 

infection. Gene-based disease include neoplastic disease and, 
for example, Alzheimer's disease. 

Compounds of formula II, II', IV, IV, VIII or X may be used in 
15 a method of therapy against a gene-based disease, such as 
cancer or Alzheimer's disease, or a viral, parasitic or 
bacterial infection . 

Another aspect of the present invention relates to the use of 
compounds of formula III, III', V, V, VII or IX in diagnostic 
methods. A compound of formula III, III'/ V, V', VII or IX 
which binds to an identified sequence of DNA or a protein known 
to be an indicator of a medical condition can be used in a 
method of diagnosis. The method may involve passing a sample, 
e.g. of appropriately treated blood or tissue extract, over an 
immobilised compound of formula III, III', V, V, VII or IX, 
for example in a column, and subsequently determining whether 
any binding of target DNA to the compound of formula III, III', 
V, V, VII or IX has taken place. Such a determination could 
be carried out by passing a known amount of labelled target DNA 
known to bind to compound III, III', V, V, VII or IX through 
the column, and calculating the amount of compound III, III' , 
V, V' , VII or IX that has remained unbound. 



«• » • 
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A further aspect of the present invention relates to the use of 
compounds of formula II, II', IV, IV, VIII or X in target 
validation. Target validation is the disruption of an 

i-d entif i c d -^N A seq u ence to — a sce rta in the funct i on o f th e 

5 sequence, and a compound of formula II, II', IV, IV, VIII or X 
can be used to selectively bind an identified sequence, and 
thus disrupt its function. 

Another aspect of the present invention relates to the use of 
10 compounds of formulae II, II', IV, IV , VIII or X in functional 
genomics to ascertain the biological function of genes, by 
blocking this biological action. 
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Preferred Synthetic Strategies 

Compounds of formula I can be synthesised by applying the 
methods described below. A review of methods of synthesising 
CPIs was carried out by Boger (Boger, D.L. et al . , J. A. Chem. 
Rev. 1997, 97, 787-828) 



Boger Synthesis of N-BOC-CBI 

Selective C4 iodination of N-BOC-4- (benzyloxy ) naphthylamine, 
readily accessible in three steps from the commercially 
available 1 , 3-dihydroxynaphthalene, followed by N-alkylation 
25 with allyl bromide provided the required substrate for the 5- 
exo-trig aryl radical-alkene cyclization (Boger, D.L. et al . , 
O. J. Am. Chem. Soc. 1989, III, 6461-6463. Boger, D.L. at al . , 
0. J. Org. Chem. 1990, 55, 5823-5832) (Scheme 1). Treatment with 
Bu 3 SnH-TEMPO (Boger, D.L.; McKie, J. A. J . Org. Chem. 1995, 3, 
1429-1453) and subsequent reduction with Zn afforded the 3- 
(hydroxymethyl) indoline CBI precursor. Conversion to the 
primary chloride and catalytic hydrogenolysis of the benzyl 
ether, followed by direct resolution on a semipreparat ive 
chiral HPLC column afforded both enantiomers (Boger, D.L.; Yun, 
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W. J. Am. Chem. Soc . 1994, 116, 7996-8006). Subsequent 
spirocyclization completed the synthesis. 



Scheme "1" 

5 




Reagents and conditions: a: NH 3/ BOC 2 0; BnBr; b: NIS; c: 
AllylBr, NaH; d: Bu 3 SnH, TEMPO; e: Zn; f: Ph 3 P-CCl 4 ; g: H 2 , Pd- 
10 C; h: NaH 

Cava synthesis of CBI 

Cava's route to CBI uses a heterocyclization procedure 
involving a heterostilbene to establish the tricyclic CBI core, 
15 followed by introduction of an additional f unct ionalized carbon 
necessary for formation of the cyclopropane (Drost, K.J.; Cava, 
M.P. J. Org. Chem. 1991, 56, 2240-2244) (Scheme 2). 



m 
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Scheme 2 
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Reagents and conditions: a: 1) t-BuOK; 2) p-nitrobenzoic acid, 
Et 3 N, Pd-C, hn; b: 1) NaBH 3 CN; 2) C1C0 2 CH 2 CC1 3 ; c: 1) DDQ; 2) 
NaBH4; d: Me 2 NH, CH 2 0; e: CH 3 I; NaCN; f: NaOH; g: 1) Me0 2 CCl, 

EtryN; 2) NaB II 3 C N; — 3^— TCBOC - C1 , Et 3 N; h: NaOH; i : 1) — (COCl) 2 ; 2) 

5 mercaptopyridine-N-oxide, DMAP, CC1 4 ; j: BCl 3 -SMe 2 ; Et 3 N 

Thus, photolysis of the alkene formed from condensation of the 
a-methoxybenzyl phosphonate with N-benzylpyrrole-2- 
carboxaldehyde in the presence of Pd-C provided the tricylic 
10 CBI core (Rawal, V.H.; Jones, R.J.; Cava, M.P. J. Org. Chem. 

1987 , 52, 19-28) . A four step debenzylat ion sequence followed 
by a regioselective Mannich alkylation was employed to provide 
the key CBI intermediate. 



15 Aristoff* synthesis of CBI (Scheme 3) 

An alternative approach to the CBI subunit was described by 
Aristoff and coworkers (Aristoff, P. A.; Johnson, P.D.; Sun, D. 
J. Med. Chem. 1993, 36, 1956-1963. Aristoff, P. A,; Johnson, 
P.D. J. Org. Chem. 1992, 57, 6234-6239). l-Allyl-2- 

20 (benzylamino) -1-hydroxydihydronaphthalenone was prepared in two 
steps from 1 , 4-naphthoquinone . Reduction and re-aromati zat ion 
was accomplished by treatment with BOC 2 0 followed by sodium 
dithionite. The racemic diol was prepared by Os0 4 -catalyzed 
dihydroxylation. Deprotection of the benzylamine, - and O- 

25 acetylation, acetate hydrolysis, followed by selective 

mesylation of the primary alcohol and TMS ether protection of 
the secondary alcohol preceded 6-membered ring closure upon 
treatment with NaH. Alcohol deprotection and resolution by 
chromatographic separation of the diastereomeric (R) -O- 

30 acetylmandelate esters provided the optically active materials. 




Reagents and conditions: a: BnNH 2 ; b: Allyl MgBr; c: BOC 2 0; 
Na 2 S 2 0<; d: 0s0 4 , NMO; e: HC0 2 H, Pd-C; Ac 2 0; f: K 2 C0 3 ; g: TMSC1, 

py i idine; h : Nail/" — 3 ; K 2 C0 3 ; k: — ( R) -O-acotylmandelic acid, EDCI; 

5 Resolution; K 2 C0 3 ; i: MsCl, Et 3 N; 1: TFA; m: NaH 

Primary alcohol activation, BOC deprotection, and transannular 
cyclization upon treatment with NaH provided the CBI 
accompanying hydrolysis of the intermediate N-Ac-CBI by water 
10 present in the reaction mixture. 

Any of these three routes may be adapted to synthesise 
compounds according to the first aspect of the present 
invention, for example by starting with appropriately 
15 substituted starting materials (which may be protected) , or by 
introducing substituents at a later stage. 

Preferred Synthetic Strategies of 
pvrrolo f 2 , 1-cl f 1 , 4 1 benzodiazepines 

20 A key step in a preferred route to compounds corresponding to 
the group D(b) of the eleventh aspect of the invention or 
combinatorial units of formulae XIa and Xlb is a cyclisation 
process to produce the B-ring, involving generation of an 
aldehyde (or functional equivalent thereof) at what will be the 

25 11-position, and attack thereon by the pro-10-nitrogen : 




In this structure, D represents XY, or a masked form thereof. 

The "masked aldehyde" -CPQ may be an acetal or thioacetal 

feps^ ibly c ycl i c) , in whic h c ase the cyclisation involves 

5 unmasking. Alternatively, the masked aldehyde may be an 

aldehyde precursor such as an alcohol -CHOH, in which case the 

reaction involves oxidation, e.g. by means of TPAP or DMSO 

(Swern oxidation) . 



10 The masked aldehyde compound can be produced by condensing a 
corresponding 2-substituted pyrrolidine with a 2-ni trobenzoic 
acid: 




15 

The nitro group can then be reduced to -NH 2 and protected by 
reaction with a suitable agent, e.g. a chlorof ormate, which 
provides the removable nitrogen protecting group in the desired 
compound . 



A process involving the oxidation-cycli zation procedure is 
illustrated in scheme 4 (an alternative type of cyclisation 
will be described later with reference to scheme 5) . 





Scheme 4 



5 If R'n is other than hydrogen, the final compound may be 

prepared by direct etherif ication of the alcohol. Compounds 
with Q T = S can be prepared by treatment of the corresponding 
alcohol with R f n SH, and a catalyst (usually a Lewis Acid such 
as A1 2 0 3 ) . If Q T = NH, then these compounds can be prepared by 
10 reacting the alcohol with an amine R ? ai NH and a catalyst 
(usually a Lewis Acid) . 

Exposure of the alcohol B (in which the 10-nitrogen is 
generally protected as an amide carbamate) to 

15 tetrapropylammonium perruthenate (TPAP) /N-methylmorpholine N- 
oxide (NMO) over A4 sieves results in oxidation accompanied by 
spontaneous B-ring closure to afford the desired product. The 
TPAP/NMO oxidation procedure is found to be particularly 
convenient for small scale reactions while the use of DMSO- 

20 based oxidation methods, particularly Swern oxidation, proves 
superior for larger scale work (e.g. > 1 g) . 

The uncyclized alcohol B may be prepared by the addition of a 
nitrogen protecting reagent of formula D, which is preferably a 
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chlorof ormate or acid chloride, to a solution of the amino 
alcohol C, generally in solution, generally in the presence of 
a base such as pyridine (preferably 2 equivalents) at a 

moderate Lempeiatu re (e.g. at 0°C) . Under these conditions 

5 little or no O-acylation is usually observed. 

The key amino alcohol C may be prepared by reduction of the 
corresponding nitro compound E, by choosing a method which will 
leave the rest of the molecule intact. Treatment of E with tin 
10 (II) chloride in a suitable solvent, e.g. refluxing methanol, 
generally affords, after the removal of the tin salts, the 
desired product in high yield. 

Exposure of E to hydrazine/Raney nickel avoids the production 
15 of tin salts and may result in a higher yield of C, although 

this method is less compatible with the range of possible C and 
A-ring substituents . For instance, if there is C-ring 
unsaturation (either in the ring itself, or in R 2 or R 3 ) , this 
technique may be unsuitable. 

20 

The nitro compound of formula E may be prepared by coupling the 
appropriate o-nitrobenzoyl chloride to a compound of formula F, 
e.g. in the presence of K 2 C0 3 at -25°C under a N 2 atmosphere. 
Compounds of formula F can be readily prepared, for example by 
25 olefination of the ketone derived from L-trans-4-hydroxy 

proline. The ketone intermediate can also be exploited by 
conversion to the enol triflate for use in palladium mediated 
coupling reactions . 

30 The o-nitrobenzoyl chloride is synthesised from the o- 

nitrobenzoic acid (or alkyl ester after hydrolysis) of formula 
G, which itself is prepared from the vanillic acid (or alkyl 
ester) derivative H. Many of these are commercially available 
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and some are disclosed in Althuis, T. H. and Hess, H. J., J . 
Medicinal Chem, 20(1), 146-266 (1977). 



Alternative Cvclisation (Scheme 5) 

5 




R'3 



Scheme 5 

In scheme 4, the final or penultimate step was an oxidative 
10 cyclisation. An alternative approach, using thioacetal 

coupling/unmasking, is shown in scheme 5. Mercury-mediated 
unmasking causes cyclisation to the desired compound. 

The thioacetal intermediates may be prepared as shown in scheme 
15 2: the thioacetal protected C-ring [prepared via a literature 
method: Langley, D.R. & Thurston, D.E., J . Organic Chemistry, 
52, 91-97 (1987)] is coupled to the o-ni trobenzoic acid (or 
alkyl ester after hydrolysis) G using a literature procedure. 
The resulting nitro compound cannot be reduced by 
20 hydrogenation, because of the thioacetal group, so the tin (II) 
chloride method is used to afford the amine. This is then N- 




protected, e.g., by reaction with a chlorof ormate or acid 
chloride, such as p-ni trobenzylchlorof ormate . 

_ A cet a ] -coj^a4-n 4ng Or ings — & an be u ^s ed as an alt e rnative — in this 

5 type of route with deprotection involving other methods 

including the use of acidic, or perhaps Lewis Acid, conditions. 

In the above synthesis schemes, the derivatisation of the A- 
ring is shown as being complete before the compounds are 

10 attached to the solid support. This is preferred is the 

substituents are groups such as alkoxy or nitro. On the other 
hand, substituent groups such as alkyl or alkenyl could be 
added to the A-ring after the coupling of the compound to the 
solid support. This may be achieved by R' 6 , R%, or R f 9 being 

15 easily replaceable groups, such as a halogen atom. 

An alternative synthesis approach to those detailed above is to 
protect the Pro N10 position on the component which will form 
the A-ring before joining the component which will form the C- 
20 ring. 



Embodiments of the present invention will now be described by 
way of example. 





• 
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Example 1 : CBI bound to TG-Carboxy resin 

Activation of TG-Carboxv resin with 3-hydroxv- 1 , 2 , 3- 
benzotriazin-4 (3H) -one (HO-Dhbt) 



o 



5 



O-CHD-coch, Q^JJ- 




TG Carboxy resin (Nova-Biochem, 80 mg 0.020 iumol) was suspended 
and swollen in DMF (1 mL) for 10 minutes. DMF was removed by 
suction and the resin treated with diisopropyl carbodiimide 
(DIC) (34 mitiL, 10 eq) in of DMF (1 mL) . The mixture was shaken 
10 for 15 minutes at -15°C (acetone-ice bath), then rinsed with 
DMF (2 x 1 mL) . 

The resin was suspended in DMF ( 1 mL) , cooled to -10°C and 
treated with HO-Dhbt (33 mg, 10 eq) . The mixture was shaken for 
15 30 minutes at -10°C / followed by 4 hours shaking at 0°C and was 
then allowed to stand at 0°C overnight. 

The reaction mixture was filtered and the resin washed with DMF 
(2 x 1 mL) , dichloromethane (2x1 mL) , MeOH (2x1 mL) and 
20 diethyl ether (1 mL) . The activated resin was stored in 
freezer until ready for use. 




• 
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Coupling of O-Dhbt activated TG-Carboxv resin with seco-CBL in 
presence of DPEA 



5 Seco-CBI has a very limited storage time, and for this reason 
BOC-seco-CBI is deprotected immediately before coupling to the 
activated resin. BOC-seco-CBI (20 mg 0.06 mmol) was dissolved 
in HC1 in anhydrous ethyl acetate (3M, 2.8 mL) . The mixture 
was kept at 0°C for 30 minutes and then allowed to warm to room 
10 temperature over 30 minutes. TLC (4:1 petroleum ether : diethyl 
ether) revealed complete reaction; the reaction mixture was 
evaporated under reduced pressure and stored at -20°C. 

O-Dhbt activated resin (40 mg) was treated with seco-CBI dissolved in 
15 DMF (1 mL) . DPEA (3 eq, 7.7 mg, 10 mmL) was added to the suspension 
and the mixture was shaken for 30 minutes at room temperature. The 
reaction mixture was filtered and the resin was washed with DMF (1 mL) , 
dichloromethane (1 mL) , methanol (1 mL) and diethyl ether (2x1 mL) . 
After thorough evaporation of solvents the resin was stored at -20°C 





20 
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Example 2: seco-CBI bound to TG-Carboxy resin 

Coupling of O-Dhbt activated TG-Carboxy resin with .seco-CBI, in 
absence of base 



HO-Dhbt 



5 




BOC-seco-CBI (20 mg, 0.06 mmol) was dissolved in HC1 solution 
in anhydrous ethyl acetate (3M, 2.8 mL) . The mixture was kept 
at 0°C for 30 minutes and then allowed to warm to room 
temperature over 30 minutes. TLC (4:1 = petroleum 
10 ether : diethyl ether) revealed complete reaction; the reaction 
mixture was evaporated at reduced pressure and directly used 
for the coupling without further purification. 

O-Dhbt-activated resin (see Example 1) (40 mg, 0.01 mmol) was 
15 treated with seco-CBI dissolved in DMF (1 mL) ; the mixture was 
agitated for 1 hour at room temperature. The reaction mixture 
was then filtered and the resin was washed with DMF (1 mL) , 
dichloromethane (1 mL) , methanol (1 mL) and diethyl ether (2x1 
mL) . After thorough evaporation of excess solvent the resin 
20 was stored at -20°C. 
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Example 3 - Synthesis or a CBI - hexapeptide library 

General Procedures 

G eneral p rocedure f or acetylation/endcapping 

5 

After each coupling step the resin was treated with a mixture 
of pyridine (30%) and acetic anhydride (20%) in 
dichloromethane, to acetylate any free amino groups that had 
not been coupled to an amino acid. In this way the formation 
10 of undesirable oligopeptides (carrying less than the expected 
number of amino acids) could be avoided. 

The resin was treated with the acetylating reagent (3 mL) and 
the slurry was agitated at room temperature for 1 hour. The 
reagents were then removed by filtration and the resin was 
15 rinsed with dichloromethane (2x5 mL) and methanol (2x5 mL) . 

General procedure for Fmoc deprotection 

The resin was treated with a solution of piperidine in DMF 
(20%, 3 mL) . The mixture was then agitated for 2 hours at room 
20 temperature. Excess solvent was then removed by. suction and 

the resin was rinsed with DMF (2x5 mL) , dichloromethane (2 x 
5 mL) and methanol (2x5 mL) . 

Library Generation 

25 Fmoc-Glu-OAll coupled to NovaSyn TG resin 

NovaSyn TG amino resin (0.345g, load 0.29 mmol/g, 0.1 mmol) was 
suspended and swollen in DMF (2 mL) , under agitation (1000 rpm) 
for 30 minutes. Excess solvent was then removed and a solution 
of Fmoc-Glu-OAll (0.123g, 0,3 mmol, 3 eq) , TBTU (0,096g, 0,3 

30 mmol, 3 eq) and DPEA (0,052 mL, 0,3 mmol, 3 eq) in DMF (3 mL) 
was added to the swollen resin. The resulting mixture was 
agitated at 1000 rpm at room temperature, overnight. The 
reaction mixture was filtered and the resin rinsed with DMF (2 
x 2 mL) , dichloromethane (2x5 mL) and methanol (2x5 mL) . 
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The resin was then acetylated (see General procedure for 
acetylation) and deprotected (see General procedure for Fmoc 
deprotection) . 

5 HO-Gly-Fmoc coupled to P-Glu (OA11) -H 

HO-Gly-Fmoc , , 

- |"peg l -GlutOAID-Fmoc ► ()-£p|G>G!u(CAIt)-Gly-Fmoc 

>- (^)-Cp1^ 

The resin was suspended in DMF (2 mL) and allowed to swell for 
10 30 minutes at room temperature, accompanied by agitation (1000 
rpm) . Excess DMF was removed and a solution of Fmoc-Gly-OH 
(0.089 g, 0.3 mmol, 3 eq) , TBTU (0.096 g, 0.3 mmol, 3 eq) and 
DPEA (0.052 mL, 0.3 mmol, 3 eq) in DMF (3 mL) was added to the 
resin. The mixture was allowed to shake at room temperature 
15 for 12 hours. Excess reagents were then removed by filtration 
and the resin was rinsed with DMF (2x5 mL) , dichlorome thane 
(2x5 mL) and methanol (2x5 mL) . The resin was then 
acetylated (see General procedure for acetylation) and 
deprotected (see General procedure for Fmoc deprotection) to 
20 afford the resin-bound dipeptide P-Glu (OA11 ) -Gly-H . 

Split & Mix procedure for the resin bound hexapeptide 
P-Glu (OA11) -Gly-XjX 2 X 3 X 4 -H sublibrary 



TBTU, 


pyr 


DPEA 


x3 




piperidine 



The resin was suspended in 3:1 mixture of 1 , 2-dichloroethane 
(DCE) and DMF and equally partitioned into 17 4mL Alltech 
tubes. Each tube thus contained 0.1/17 mmol = 5.88 10" 6 mol of 
30 resin-bound dipeptide. Excess solvent was removed in vacuo, . 




and the resin was suspended in DMF (200 mL) and agitated for 30 
minutes. The 17 amino acids (1.76 10 -5 mmol, 3 eq for each 
step, 7.04 10" 5 mmol for 4 steps) were weighed into 17 vials: 



5 


1 . 


Fmoc-Ala-OH - 


Z Z 


mg 




2 . 


Fmoc-Asn-OH 


Z D 


mg 




3 . 


Fmoc-Asp (OtBu) -OH 


2 9 


mg 




4 . 


Fmoc-Gln-OH 


2 6 


mg 




5 . 


Fmoc-Glu (OtBu) -OH 


30 


mg 


10 


6 . 


Fmoc-Gly-OH 


z 1 


mg 




7 . 


Fmoc-Ile-OH 


2 o 


mg 




8 . 


Fmoc-Leu-OH 


25 


mg 




9. 


Fmoc-Lys (BOC) -OH 


33 


mg 




10. 


Fmoc-Met-OH 


26 


mg 


15 


11 . 


Fmoc-Phe-OH 


27 


mg 




12 . 


Fmoc-Pro-OH 


24 


mg 




13 . 


Fmoc-Ser (tBu) -OH 


27 


mg 




14 . 


Fmoc-Thr (tBu) -OH 


28 


mg 




15. 


Fmoc-Trp (BOC) -OH 


37 


mg 


20 


16. 


Fmoc-Tyr (tBu) -OH 


32 


mg 




17. 


Fmoc-Val-OH 


24 


mg 



Each amino acid was dissolved in DMF (2 mL) ; an aliquot of each 
solution (0.5 mL, corresponding to 1.76 10" 5 mmol, 3 eq of each 
25 amino acid) was added to the appropriate tube. TBTU (1.76 10~ 5 
mmol x 17 = 2.99 10"\ 96 mg) and DPEA (1.76 10" 5 mmol x 17 = 
2.99 10" 4 , 52 mL) were separately dissolved in DMF (1.7 mL) and 
each solution was evenly distributed, delivering 3 eq of each 
reagent, to each one of the 17 tubes. 

30 

The reaction tubes were agitated at room temperature for 12 
hours, then the reagents and solvents were removed in vacuo and 
the resin was rinsed with DMF (2 x 1 mL each tube), DCM (2x1 
mL each tube) and methanol (2 x 1 mL each tube) . The resin was 
35 then suspended in 3 : 1 mixture of 1 , 2-dichloroethane and DMF and 
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recombined . The recombined resin was acetylated (3 mL of 
acetylating reagent, 1 hour, room temperature) and deprotected 
(3 mL of 20% piperidine in DMF, 2 hours, room temperature) . 

5 The procedure was repeated 3 more times. At the end of the 4 th 
amino acid coupling the deprotection step was not executed. 

Deorotection of Allvl ester 



N . . PhSiH- ^-v , 

1 0 (y\ PEG| -Glu(OAII)-Gly-X 1 X 2 X 3 X 4 Fmoc Pd [p Ph ] 4 * C J~ \ PEG h Giu ( QH )" G| y~ X i X ? X 3 X ^ Fmoc 



The resin was suspended in DCM (2 mL) , phenylsilane (2.4 mmol, 
24 eq, 0.29 mL) in DCM (1 mL) of was added and the mixture was 
shaken at room temperature for 10 minutes. 

A catalytic amount of Pd[.PPh 3 ] 4 (0.01 mmol, 0.1 eq, 11 mg) in 
DCM (0.5 mL) was added and the reaction mixture was shaken for 
further 10 minutes. 



20 The reagents were filtered and the resin was rinsed with DCM (2 
x 5 mL) and methanol (2x5 mL) . The procedure was repeated 
once again and the resin was finally dried under reduced 
pressure. 

2 5 Activation of resin with 3-hydroxv-l , 2 , 3-benzotriazin-4 ( 3H) - 
one (HO-Dhbt ) 

OH O-Dhbt 

I DIC.-15-C — ^ | 

( )_ rPEG~l -dlu-Glv-XiX?X^XiFmoc ( j-OIE^GIu-GlyOC^XaX^Fmoc 

HO-Ohbt, -15"C V — ' 



30 The resin was suspended and swollen in DMF (2 mL) for 10 

minutes. Excess DMF was removed and the resin was treated with 
a solution of diisopropyl carbodiimide (DIC) (156 mL 1 mmol, 10 
eq) in DMF (2 mL) . The resulting slurry was shaken for 15 



minutes at -15°C (acetone-ice bath), then washed with DMF (2 x 
2 mL) . 

The resin was resuspended in DMF (2 mL) , cooled no -lCru and 
5 treated with HO-Dhbt (163 mg 1 mmol, 10 eq) . The mixture was 
shaken for 30 minutes at -10°C, followed by 4 hours at 0°C and 
then allowed to stand at 0°C overnight. 

The reaction mixture was filtered and the resin washed with DMF 
10 (2x2 mL) , dichloromethane (2x2 mL) , MeOH (2x2 mL) and 
diethyl ether (2 mL) . The activated resin was stored in 
freezer until required for use. 

Coupling of O-Dhbt activated resin with seco-CB L in presence 
15 of DPEA 

O-Dhbt CBI 
(^-□SZh^ lu-G 'y -X 1 X 2 X 3 X 4 Fm oc CB> ' ° PEA > Q-l h^lu-G ly -X 1 X 2 X 3 X«F m oc 

BOC-seco-CBI (see example 2) was deprotected immediately before 
coupling to the activated resin. BOC-seco-CBI (100 mg, 0.3 
mmol) was dissolved in a solution of HC1 in anhydrous ethyl 
acetate (3M, 15 mL) . The mixture was cooled at 0°C for 30 
minutes and then allowed to warm to room temperature over 30 
minutes. TLC (4:1= petroleum ether : diethyl ether) revealed 
complete reaction; the reaction mixture was evaporated under 
reduced pressure and stored at -20°C. 

O-Dhbt-activated resin was treated with seco-CBI dissolved in 
DMF (2 mL) and the mixture was shaken for 60 minutes at room 
30 temperature. The reaction mixture was filtered and the resin 
was washed with DMF (2x2 mL) , dichloromethane (2x2 mL) , 
methanol (2x2 mL) and diethyl ether (2x1 mL) . The resin 
was dried under reduced pressure and stored in a cool, dry 
place . 
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t-Butvl- and BOC- deprotection on side chains 
The resin was treated with a solution of TIS/TFA in 
dichloromethane (2%, 2 mL) . The mixture was shaken for 2 hours 
at room temperature. Excess reagents were removed in vacuo and 
5 the resin was rinsed with dichloromethane (2x5 mL) and 
methanol (2x5 mL) . 



Fmoc deprotection 

10 CB , CBI 

(^^_TPEG | -G lu -G ly -X 1 X 2 X 3 X 4 F m oc _ piperidine ^ (^)_ Q5|g— |_ G lu -G »y -X^XaX^H 



The resin was treated with of a solution of piperidine in 
DMF(20% / 3 mL) , and agitated at room temperature for 2 hours. 
The mixture was filtered and the resin was washed with DMF (2 x 
5 mL) , dichloromethane (2x5 mL) and methanol (2x5 mL) . The 
15 resin was then dried under reduced pressure. 

Example 4: Synthesis of A CBI Combinatorial Building B lock: 7^ 

carboxv-1 , 2 . 9 . 9a-tetrahydrocvclopropa fclbenzo [el indol-4-one 

20 The synthesis proceeds from a modified Stobbe 

condensation/Friedel-Crafts acylation for generation of the 
functionalised precursor, followed by 5-exo~t-rig aryl radical- 
alkene cyclization . 

25 Wadsworth-Horner-Emmons condensation of 3-bromo-benzaldehyde 
with the Sargent phosphonate predominantly provides the E- 
isomer, which in turn undergoes acid-catalyzed deprotection and 
Friedel-Craf ts acylation (Scheme 6) . 
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Scheme 6 



o 




| R = NC 9 H 16 

1 - R = H 

5 Reagents and conditions: a: NaH, Sargent phosphonate; b: TFA; 
c: 1) Ac 2 0-KOAc; 2) K 2 C0 3 ; 3) BnBr, K 2 C0 3 ; d: GuCN; e: 1) LiOH; 
2) DPPA, t-BuOH; f: NIS; g: allyl Br, NaH; h: Bu 3 SnH, TEMPO; i: 
Zn-HOAc; j: Ph 3 P-CCl 4 ; k: NaOH 

10 Aromatic nucleophilic substitution, ester hydrolysis and 
Curtius rearrangement effected by treatment with DPPA are 
followed by regioselecti ve C4 iodination and N-alkylation with 
allyl bromide. The aryl radical-alkene cyclization by means of 
TEMPO as radical trap, as described in Boger synthesis of CBI, 

15 provides the tricyclic system that, after conversion to the 
primary chloride and base-catalyzed hydrolysis of the cyano 



# 
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group, should give the desired building block for a 
combinatorial library . 



ExamplT"5T~Bcreeninq of ~C*B1 QTi bead a a a i 1 1 s L DNA sequence 



10 



15 



The CBI on bead synthesised in example 1 was screened against 
rhodamine labelled double stranded DNA with the sequence: 
Label-5 1 -GCG TAA AAA CGC-3 1 . 

For comparison, indoline was added to O-Dhbt-activated TG 
Carboxy resin (as prepared in example 1) to yield a comparative 
resin that does not covalently interact with DNA. 

o 



chsh^q 



The resin alone was also tested to provide a background 
measurement. The screening protocol used was: 

1. Weigh out approximately 1 mg of resin into an Eppendorf. 

20 2. Incubate the resin with annealed rhodamine labelled double 
strand DNA for 24 hours at 37 °C. 
3. After 2 4 h incubation wash the resin 4 times and re- 
suspend beads in 50 mL of TE (EDTA and Tris buffer) or 
PBS (phosphate buffered saline) . 

25 4. Resin was transferred to a black 96 well plate with 

transparent base, fluorescence was measured from below at 
635 nm using a Tecan Spectrofluor (590 nm excitation, 635 
nm emission) . 



30 



Resin 


Relative Fluorescence Units 


TG Carboxy Resin 


892 


Indoline Bound to Resin 


625 


CBI Bound to Resin . 


2947 



Only a small amount of DNA binding was observed with the acid 
resin and with the resin loaded with an inactive indoline CBI 
mimic. An almost five fold differential was observed between 
-tire— €fi-I — r-esd-n — a-nd"I-nidc)d^^^-^^^d^-^^mon^-tr^a"t ing DNA bindir rg- 
the resin bound CBI. It is anticipated that a higher 
concentration of DNA would lead to a higher differential 
between the resins as it is possible that the CBI-resin is not 
saturated with DNA at this concentration. 



Example 6: Synthesis of a CBI - PBD - hexapeptide library 

General procedure for acetvlation/endcappinq 

After any coupling step the resin was treated with a mixture of 
pyridine (30%) and acetic anhydride (20%) in dichloromethane, 
to acetylate any free amine that has not been coupled to an 
amino acid. The formation of unexpected oligopeptides 
(carrying less than the expected number of aminoacids) is 
avoided . 

The resin was treated with 3 mL of acetylating reagent and the 
slurry was agitated at room temperature for 1 hour. The 
reagents were then removed by filtration and the resin was 
rinsed with dichloromethane (2x5 mL) and methanol (2x5 mL) . 

General procedure for Fmoc deprotection 

The resin was treated with 3 mL of a solution of piperidine 
(20%) in DMF. The mixture was then agitated for 2 hours at 
room temperature. The solvent was filtered off and the resin 
was rinsed with DMF (2x5 mL) , dichloromethane (2x5 mL) and 
methanol (2x5 mL) . 



20 



25 
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Fmoc-Glu-OAll coupled to NovaSvn TG resin 



+ Fmoc-Glu-OAH (^Vr~^^ (^)h PEG |- 



Giu(OAll)-H 



NovaSyn TG amino resin 130 p (0.345 g, load 0.29 ramol/g, 0.1 
mmol) was suspended ad swollen into 2 mL of DMF, under 
agitation (1000 rpm) for 30 minutes. The solvent was then 
removed and a solution of Fmoc-Glu-OAll (0.123g, 0.3 mmol, 3 

10 eq) , TBTU (0.096g, 0.3 mmol, 3 eq) and DPEA (0.052 mL, 0.3 

mmol, 3 eq) in 3 mL of DMF was added to the swollen resin. The 
resulting mixture was agitated at 1000 rpm, room temperature, 
overnight. The reaction mixture was filtered and the resin 
rinsed with DMF (2x2 mL) , dichloromethane (2x5 mL) and methanol 

15 (2x5 mL) . 

The resin was then acetylated (see General procedure for 
acetylation) and deprotected (see General procedure for Fmoc 
deprotection) . 



HO-Glv-Fmoc coupled to P-Glu (PAID -H 



N HO-Gly-Fmoc n 

( V rPEG> Glu(OAll)-Fmoc (H 



peg l -Glu(OAIl)-Glv-Fmoc 



o 



PEG> Glu(OAII)-Glv-H 



The resin was suspended in 2 mL of DMF and let swollen for 
30 minutes at room temperature, under agitation (1000 rpm) . DMF 
was removed and a solution of Fmoc-Gly-OH (0.089 g, 0.3 mmol, 3 
eq) , TBTU (0.096 g, 0.3 mmol, 3 eq) and DPEA (0.052 mL, 0.3 
30 mmol, 3 eq) in 3 mL of DMF was added to the resin. The mixture 
was shaken at room temperature for 12 hours. The reagents were 
then removed by filtration and the resin was rinsed with DMF 



m 
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(2x5 mL) , dichloromethane (2x5 mL) and methanol (2x5 mL) . The 
resin was then acetylated (see General procedure for 
acetylation) and deprotected (see General procedure for Fmoc 
-depirolre-e- t ion ^ — t-a-a4ford t h e r e sin-b o und dip e ptid e P-Glu(OAll) — 
Gly-H. 



10 



Split & Mix procedure for the resin bound hexapeptide P- 
Glu (PAID -GIv-X tXoX^ -H sublibrary 



split AAs combine AC2O 



TBTU, 
DPEA 



pyr 
piperidine 



r^y TTE^ Gtu(OAII^^ 



15 



20 



25 



30 



The resin was suspended in 3:1 mixture of 1 , 2-dichloroethane 
and DMF and equally partitioned into 17 4 mL Alltech tubes. 
Each tube thus contained 0.1/17 mmol = 5.88 10" 6 mol of resin- 
bound dipeptide. Solvent was removed in vacuo, and the resin 
was suspended in 200 mL of DMF and agitated for 30 minutes. The 
17 aminoacids (1.76 10 -5 mmol, 3 eq for each step, 7.04 10 -5 
mmol for 4 steps) were weighed into 17 vials: 



1 . 


Fmoc -Ala -OH 


22 


mg 


2. 


Fmoc-Asn-OH 


25 


mg 


3. 


Fmoc-Asp (OtBu) -OH 


29 


mg 


4 . 


Fmoc-Gln-OH 


26 


mg 


5. 


Fmoc-Glu (OtBu) -OH 


30 


mg 


6. 


Fmoc-Gly-OH 


21 


mg 


7 . 


Fmoc-Ile-OH 


25 


mg 


8 . 


Fmoc-Leu-OH 


25 


mg 


9. 


Fmoc-Lys (BOC) -OH 


33 


mg 


10. 


Fmoc -Met -OH 


.26 


mg 


11 . 


Fmoc-Phe-OH 


27 


mg 


12 . 


Fmoc-Pro-OH 


24 


mg 


13. 


Fmoc-Ser (tBu) -OH 


27 


mg 


14 . 


Fmoc-Thr (tBu) -OH 


28 


mg 



4 



17 . 



16. 



15. 



Fmoc-Trp (BOC) -OH 
Fmoc-Tyr ( tBu) -OH 



Fmoc-Val-OH 



3 7 mg 
32 mg 
24 mg 



5 Each amino acid was dissolved in 2 mL of DMF; 0.5 mL of this 
solution (corresponding to 1.76 10" 5 mmol, 3 eq of each amino 
acid) was added to the appropriate tube. TBTU (1.76 10' 5 mmol 
x 17 = 2.99 1CT\ 96 mg) and DPEA (1.76 10" 5 mmol x 17 = 2.99 10" 
\ 52 mL) were separately dissolved in 1.7 mL of DMF and each 
10 solution was evenly partitioned, addressing 3 eq of each 
reagent, to each one of the 17 tubes. 

The reaction tubes were agitated at room temperature for 12 
hours, then the reagents and solvents were removed in vacuo and 

15 the resin was rinsed with DMF (2x1 mL each tube), DCM (2x1 mL 
each tube) and methanol (2x1 mL each tube) . The resin was then 
suspended in 3:1 mixture of 1 , 2-dichloroethane and DMF and 
recombined. The recombined resin was acetylated (3 mL of 
acetylating reagent, 1 hour, room temperature) and deprotected 

20 (3 mL of 20% piperidine in DMF, 2 hours, room temperature) . 

The procedure was repeated 3 more times to afford the 
deprotected resin-bound hexapeptide G (OA11 ) EX a X 2 X 3 X 4 H . 

25 Coupling of Fmoc (Alloc) Lvs-OH to resin -bound hexapeptide 



30 The resin was suspended in 2 mL of DMF and was let swollen for 
10 minutes at room temperature, under agitation (1000 rpm) . DMF 
was removed and a solution of Fmoc (Alloc) Lys-OH (0.135g, 0.3 
mmol, 3 eq) , TBTU (0.096g, 0.3 mmol, 3 eq) and DPEA (0.052 mL, 
0.3 mmol, 3 eq) in 3 mL of DMF was added to the resin. The 

35 mixture was shaken at room temperature for 12 hours. The 



OAII 



OAII 



Alloc 




# 



5 
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reagents were then removed by filtration and the resin was 
rinsed with DMF (2x5 mL) , dichloromethane (2x5 mL) and methanol 
(2x5 mL) . The resin was then acetylated (see General procedure 
-for — ac e t ylat ion ) — t o afford the resin bound heptapep tid e . 

Deorotection of Allvl ester 



OAII Alloc 

I I PhSiH 3 / "\ , , 

PEGl-Giu-Gly-X 1 ^X 3 X 4 -L}sFmoc V ( H PEG 1 -Glu-Glv-XjXgX^LvsFmoc 

pd[pph 3 j 4 

The resin was suspended in 2 mL of DCM. Phenylsilane (2.4 mmol, 
24 eq, 0.29 mL) in 1 mL of DCM was added and the mixture was 
shaken at room temperature for 10 minutes. Pd[PPH 3 ] 4 (0.01 
mmol, 0.1 eq, 11 mg) in 0 . 5 mL of DCM was added and the 
15 reaction mixture was shaken for further 10 minutes. 

The reagents were filtered and the resin was rinsed with DCM 
(2x5 mL) and methanol (2x5 mL),then the procedure was repeated 
once again. The resin was finally dried under reduced pressure. 




20 
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Coupling of NIO-Fmoc protected PBD to lysine residue of the 
resin-bound heptapeptide 




ZlGD-Glu-Gly-Xt^X^LysFmoc 



5 




fH] - Gl u-G ly -X^X^Ly sF moc 



Fmoc 



OH 



O 

MeO 




O 



The resin was suspended in 2 mL of DMF and agitated for 10 
minutes to allow swelling. DMF was then removed. A solution of 
NIO-Fmoc-PBD acid (0.556 g, 1 mmol, 10 eq. ) , TBTU (0.353 g, 1.1 

10 mmol, 1.1 eq.) and DPEA (0.19 mL, 1.1 mmol, 1.1 eq.) in 3 mL of 
DMF was stirred for 30 minutes, and added to the resin. The 
mixture was agitated at room temperature for 12 hours, then the 
reagents were removed under reduced pressure and the resin was 
rinsed with DMF (2x5 mL) , dichloromethane (2x5 mL) , methanol 

15 (2x5 mL) and diethyl ether (1x2 mL) . 

Activation of resin with 3-hvdroxv-l , 2 , 3-benzotriazin-4 ( 3H) -one 
(HO-Dhbt) 

20 



The resin was suspended and swollen in 2 mL of DMF for 10 
25 minutes. DMF was removed and the resin was treated with 156 mL 
(1 mmol, 10 eq) of diisopropyl carbodiimide (DIC) in 2 mL of 



PBD(Fmoc) 



O-Dhbt 



PBD(Fmoc) 




DMF. The resulting slurry was shaken for 15 minutes at -15°C 
(acetone-ice bath) , then washed with DMF (2x2 mL) . 



Ttre-reg in was re su -s-pe- nd c d in 2 mL of BN-F-7 — cooled at -10°C and 

5 treated with 163 mg (1 mrnol, 10 eq) of HO-Dhbt . The mixture was 
shaken for 30 minutes at -10°C, for 4 hours at 0°C and then 
allowed to stand at 0°C overnight. 

The reaction mixture was filtered and the resin washed with DMF 
10 (2x2 mL) , dichloromethane (2x2 mL) , MeOH (2x2 mL) and diethyl 
ether (2 mL) . The activated resin was stored in freezer until 
ready for use. 

Coupling of O-Dhbt activated resin with seco-CBL in presence 
15 of DPEA 



20 

BOC-seco-CBI was deprotected immediately before coupling to the 
activated resin. 100 mg (0.3 mmol) of BOC-seco-CBI was 
dissolved in 15 mL of 3M HC1 solution in anhydrous ethyl 
acetate. The mixture was cooled at 0°C for 30 minutes and then 
25 allowed to warm to room temperature for further 30 minutes. 
TLC (4:1 = petroleum ether : diethyl ether) revealed reaction 
completion; the reaction mixture was evaporated at reduced 
pressure and stored at -20°C. 

30 O-Dhbt-activated resin was treated with seco-CBI dissolved in 2 
mL of DMF and the mixture was shaken for 60 minutes at room 
temperature. At the end the reaction mixture was filtered and 
the resin was washed with DMF (2x2 mL) , dichloromethane (2x2 
mL), methanol (2x2 mL) and diethyl ether (2x1 mL) . The resin 



O-Dhbt 



PBD(Fmoc) 



CBI 



PBD(Fmoc) 




was dried under reduced pressure and stored in a cool, dry 
place . 

t-ButvJ- and BOO deprotection on side chains 

5 

The resin was treated with a solution of TIS/TFA 2% in 
dichloromethane (2 mL) . The mixture was shaken for 2 hours at 
room temperature. The reagents were removed in vacuo and the 
resin was rinsed with dichloromethane (2x5 mL) and methanol 
10 (2x5 mL) . 

Fmoc deprotection 



The resin was treated with 3 mL of a solution of piperidine 
(20%) in DMF, and agitated at room temperature for 2 hours. At 
the end the mixture was filtered and the resin was washed with 
20 DMF (2x5 mL) , dichloromethane (2x5 mL) and methanol (2x5 mL) . 
The resin was then dried under reduced pressure. 



CBI 



PBD(Fmoc) 



CBI 



PBD 





CLAIMS 



1. A compound of formula I: 




Y 



wherein : 

X is an electrophilic leaving group; 
Y is selected from NH-Prot, O-Prot, S-Prot, N0 2 , 
NHOH, N 3/ NHR, NRR, N=NR, N(0)RR, NHS0 2 R, N=NPhR, SR or 
10 SSR, where Prot represents a protecting group; 

A and B collectively represent a fused benzene or 
pyrrole ring (in either orientation), which is optionally 
substituted by up to respectively 4 or 2 groups 
independently selected from R, OH, OR, halo, nitro, amino, 
15 Me 3 Sn, C0 2 H, C0 2 R; 

R a is a nitrogen protecting group, where if Y 
includes a protecting group, these protecting groups are 
orthogonal ; 

R 2 and R 7 are independently selected from H, R, OH, 
20 OR, halo, nitro, amino, Me 3 Sn; 

wherein R is selected from: 

(a) a lower alkyl group having 1 to 10 carbon 
atoms, 

(b) an aralkyl group (i.e. an alkyl group with 
25 one or more aryl substituents ) , preferably of up to 

12 carbon atoms; 

the alkyl group of (a) or (b) optionally 
containing one or more carbon-carbon double or triple 
bonds, which may form part of a conjugated system; 
30 and 



m 





10 
15 

2. 

20 

3. 

25 
30 
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(c) an aryl group, preferably of up to 12 carbon 
atoms; 

and wherein: 

R is optionally substituted by one or more halo, 
hydroxy, amino, or nitro groups, and optionally 
contains one or more hetero atoms, which may form 
part of, or be, a functional group; 

except that when is Boc, Y is N0 2 , X is CI, and R 2 
and R 7 are H, then A and B do not collectively represent 
either an unsubst i tuted benzene ring or: 



A compound according to claim 1, wherein R is 
independently selected from a lower alkyl group having 1 
to 10 carbon atoms, or an aralkyl group, preferably of up 
to 12 carbon atoms, or an aryl group, preferably of up to 
12 carbon atoms, optionally substituted by one or more 
halo, hydroxy, amino, or nitro groups. 

A compound according to claim 2, wherein R is 
independently selected from lower alkyl groups having 1 to 
10 carbon atoms optionally substituted by one or more 
halo, hydroxy, amino, or nitro groups. 

A compound according to claim 3, wherein R is an 
unsubstituted straight or branched chain alkyl group, 
having 1 to 10 carbon atoms. 




m # 
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A compound according to any one of the preceding claims, 
wherein R 3 has a carbamate functionality where it binds to 
the nitrogen atom of the CPI . 



5 6. A compound according to any one of the preceding claims, 
wherein Y is NH-Prot, O-Prot or S-Prot. 

7. A compound according to claim 6, wherein Y is NH-Prot. 

10 8. A compound according to any one of the preceding claims, 
wherein X is either halogen or OS0 2 R. 

9. A compound according to any one of the preceding claims, 
wherein the 4,5 fused ring is substituted by -C0 2 R in the 

15 2 or 3 position if it is a benzene ring, or in the 2 

position if it is a pyrrole ring. 

10. The use of compounds of formula I: 

X 



20 




(I) 



wherein : 

X is an electrophilic leaving group; 
Y is selected from NH-Prot, O-Prot, S-Prot, N0 2 , 
NHOH, N 3 , NHR, NRR, N=NR, N(0)RR, NHS0 2 R, N=NPhR, SR or 
25 SSR, where Prot represents a protecting group; 

A and B collectively represent a fused benzene or 
pyrrole ring (in either orientation) , which is optionally 
substituted by up to respectively 4 or 2 groups 
independently selected from R, OH, OR, halo, nitro, amino, 
30 Me 3 Sn, C0 2 H, C0 2 R; 
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R } is a nitrogen protecting group, where if Y 
includes a protecting group, these protecting groups are 
orthogonal ; 



R : . and R 7 are independently selected trom H, R^ UFT, 
5 OR, halo, nitro, amino, Me 3 Sn; 

wherein R is selected from: 

(a) a lower alkyl group having 1 to 10 carbon 
atoms, 

(b) an aralkyl group (i.e. an alkyl group with 
10 one or more aryl subs ti tuents ) , preferably of up to 

12 carbon atoms ; 

the alkyl group of (a) or (b) optionally 
containing one or more carbon-carbon double or triple 
bonds, which may form part of a conjugated system; 
15 and 

(c) an aryl group, preferably of up to 12 carbon 
atoms ; 

and wherein: 

R is optionally substituted by one or more halo, 
20 hydroxy, amino, or nitro groups, and optionally 

contains one or more hetero atoms, which may form 
part of, or be, a functional group; 
in methods of combinatorial chemistry synthesis, wherein 
the compound of formula I is joined to a solid support by 
25 a chain comprising at least one combinatorial unit. 

11. A compound of formula III: 

X 



N -^-hrL-0 
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wherein : 

X, Y, A, B, R 2 and R 7 are as defined in claim 10; 

T is a combinatorial unit; 

n^^™a^o^d-iri^e^-irt^^-er^ — w her e if n — i-s gr e at e r 

than 1, each T may be different; 

L is a linking group, or less preferably a single 
bond; and, 

O is a solid support. 



10 12 



15 



A compound of formula III' 

R 3 




-t-T-fc-L-O 



R 7 



(III 1 ) 



wherein : 

A, B, R 2 , R 7 , T, n, L and O are as defined in claim 11; 
and, 

Y' is NH, O or S. 



20 



13. A compound of formula II 

X 




(II) 



wherein : 

X, Y, A/ B, R 2/ 



R 7 , T and n are as defined in claim 11. 



60 



14. A compound of formula II' 

R 2 




. (II') 

b' yr r 7 

r 

wherein : 

A, B, T, n, R 2 , R-, and Y 1 are as defined in claim 12 



15. A compound of formula V: 



10 



Ch-H-Htn 




(V) 



wherein : 

A, B, Y, R lf R 2/ and R 7 , are as defined in claim 10; and, 
T, n, L and O are as defined claim 11. 



15 



20 



16. A compound of formula V 




(V) 



wherein : 

A, B, R lt R 2 , and R 7 are as defined in claim 10; and, 
T, n, L, Y 1 and O are as defined in claim 12. 




* 
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17 . 



A compound of formula IV: 



X 



B 




(IV) 



18. 



10 



wherein : 

A, B, X, Y, R lf R 2 and R 7 are as defined in claim 10; and, 
T and n are as defined in claim 11. 

A compound of formula IV : 



wherein : 

A, B, T, n, R lr R 2 and R 7 are as defined in claim 17; and, 
Y 1 is NH, O or S. 

A method of preparing a compound according to claim 11 by 
reaction of a compound of formula VI: 




(IV) 




with a compound of formula I: 



# 
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wherein : 

A, B, R 2 , R 7 / T, n, L and O are as defined in 
5 claim 11; and, 

W is H or an atom or group for providing a 
functional group capable of reaction with -NH 2 . 

20. A method of preparing a compound according to claim 15, by 
10 reaction of a compound of formula VI: 



15 




w 



(VI) 



with a compound of formula I according to claim 9, 
wherein: 

T, n, L and 0 are as defined in claim 15; and, 
W is H or an atom or group for providing a functional 
group capable of reaction with -COOH. 



20 



25 



21. A compound of formula VII 




(VII) 



wherein : 

O, T, and L are as defined in the claim 11; 

n and m are positive integers, or one of them may be 

zero; 

T' is a combinatorial unit, where each T" may be 
different if m is greater than 1; 




* 
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10 



15 



20 



wherein A, B, Y, R w R 2 and R 7 are as defined in 
claim 10 and Y' is NH, O or S; 



wherein: 

X' is selected from CO, NH, S, or O; 
G is O, S, NH, or a single bond; 

R' 2 and R' 3 are independently selected from: H, 
R, OH, OR, =0, =CH-R, =CH 2 , CH 2 -C0 2 R, CH 2 -C0 2 H, 
CH 2 -S0 2 R, 0-S0 2 R, C0 2 R, COR and CN, and there is 
optionally a double bond between C 2 and C 3 ; 

R' 6 , R' 7 / and R' 9 are independently selected from 
H, R, OH, OR, halo, nitro, amino, Me 3 Sn; 

R'u is either H or R; 

Q' is S, O or NH; 

R' 10 is a nitrogen protecting group; 

Y" is a divalent group such that HY = R; 

p is a positive integer, where if p is greater 
than 1, for each repeating unit, the meaning of T, 
T' , T" and D and the values of n and m are 
independently selected; and, 

E is selected from the same possibilities as D; 

provided that at least one group D or E is 
selected from (a) . 



(b) 
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22. A compound of formula (VIII) : 





D 


-F (VIII) 






p 



wherein : 

5 L, T, T" , T", D, E, n, m and p are as defined in claim 21. 



23. A compound of formula (IX) : 



10 



15 



20 




H 

T" 



(IX) 



wherein : 

O, L, T, T' , T", n, m and p are as defined in 
claim 21; 

T' " is a combinatorial unit; 

q is a positive integer, where if q is greater than 
1, each T'" may be different; and, 

E a is selected from the group (a) of E as defined in 
claim 21; 
wherein: 

if p is greater than 1, for each repeating unit the 
meaning of T, T' , T", T"' and the values of n, m and q are 
independently selected. 
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24. A compound of formula (X) : 



H 
T" 



H- 



(X) 



wherein : 

5 L, T, T' , T", T'", E a , n, m, p and q are as defined in 

claim 23 . 

25. A collection of compounds all of which are represented by 
either : 

10 (i) formula III as defined in claim 11; 

(ii) formula III' as defined in claim 12; 

(iii) formula II as defined in claim 13; 

(iv) formula II' as defined in claim 14; 

(v) formula V as defined in claim 15; 
15 (vi) formula V' as defined in claim 16; 

(vii) formula IV as defined in claim 17; 

(viii) formula IV' as defined in claim 18; 

(ix) formula VII as defined in claim 21; 

(x) formula VIII as defined in claim 22; 

20 (xi) formula IX as defined in claim 25; or, 

(xii) formula X as defined in claim 24. 



25 



30 



26. A method of preparing a collection of compounds as defined 
in claim 25. 

27. A method of screening compounds of: 

(i) formula II as defined in claim 13; 

(ii) formula II' as defined in claim 14; 

(iii) formula IV as defined in claim 17; 

(iv) formula IV as defined in claim 18; 

(v) formula VIII as defined in claim 22; or, 
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(vi) formula X as defined in claim 24; 
to discover biologically active compounds. 



9R . ThP n.sp of a compound of: 

5 (i) formula II as defined in claim 13; 

(ii) formula II' as defined in claim 14; 

(iii) formula IV as defined in claim 17; 

(iv) formula IV as defined in claim 18; 

(v) formula VIII as defined in claim 22; or 
10 (vi) formula X as defined in claim 24; 

in the manufacture of a cytotoxic, antibiotic, 
antiparasitic or antiviral therapeutic composition. 

29. The use of a compound of: 

15 (i) formula III as defined in claim 11; 

(ii) formula III' as defined in claim 12; 

(iii) formula V as defined in claim 15; 

(iv) formula V as defined in claim 16; 

(v) formula VII as defined in claim 21; or, 
20 (vi) formula, IX >as defined in claim 23; 

in a method of diagnosis. 

30. The use of a compound of: 

(i) formula II as defined in claim 13; 
25 (ii) formula II' as defined in claim 14; 

(iii) formula IV as defined in claim 17; 

(iv) formula IV as defined in claim 18; 

(v) formula VIII as defined in claim 22; or, 

(vi) formula X as defined in claim 24; 

30 in a method of target validation or functional genomics. 
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